Blind speech separation of moving speakers in real reverberant environments

نویسندگان

  • Athanasios Koutras
  • Evangelos Dermatas
  • George K. Kokkinakis
چکیده

In this paper we present a new on-line Blind Signal Separation method capable to separate convolutive speech signals of moving speakers in highly reverberant rooms. The separation network used is a recurrent network which performs separation of convolutive speech mixtures in the time domain, without any prior knowledge of the propagation media, based on the Maximum Likelihood Estimation (MLE) principle. The proposed method proved to be able to improve significantly (more than 10% in all adverse mixing situations) the performance of a continuous phoneme-based speech recognition system and therefore can be used as a front-end to separate simultaneous speech of speakers who are moving in arbitrary directions in reverberant rooms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Blind speech separation of moving speakers using hybrid neural networks

In this paper we present a novel method for Blind Speech Separation of convolutive speech signals of moving speakers in highly reverberant rooms. The separation network used is a hybrid neural network, which performs separation of convolutive speech mixtures in the time domain, without any prior knowledge of the propagation media, based on the Maximum Likelihood Estimation (MLE) principle. The ...

متن کامل

Simultaneous speech recognition in noisy reverberant environme

In this paper, we examine the robustness of a Blind Signal Separation (BSS) technique in the time domain, based on a recurrent neural network, for separating multiple competing speakers in real reverberant environments. The separation network’s learning rule is based on the Maximum Likelihood Estimation criterion and was tested in real room situations in a noise-free and a noisy reverberant env...

متن کامل

Online blind speech separation using multiple acoustic speaker tracking and time-frequency masking

Separating speech signals of multiple simultaneous talkers in a reverberant enclosure is known as the cocktail party problem. In real-time applications online solutions capable of separating the signals as they are observed are required in contrast to separating the signals offline after observation. Often a talker may move, which should also be considered by the separation system. This work pr...

متن کامل

Applying Blind Signal Separation to the Recognition of Overlapped Speech

Blind signal separation method based on minimizing mutual information is applied to deal with multispeaker problem in speech recognition. Recognition experiments performed under di erent acoustic environments, in a soundproof room and a reverberant room, clarify that 1) the method can improve recognition accuracy by about 20% where SNR condition is 0 dB, 2) the method is more e ective when many...

متن کامل

Reverberation-Robust Online Multi-Speaker Tracking by Using a Microphone Array and CASA Processing

Online tracking of speakers is an important task for applications in smart environments such as camera control, meeting annotation and speech separation. Challenges for an audio-only system are small-room reverberation, noise, the unknown number of speakers, and gaps occurring in natural speech. Combining models from neurobiology and cognitive psychology with many-channel signal processing and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000